#> ── Attaching core tidyverse packages ──────────────────────── tidyverse 2.0.0 ──
#> ✔ dplyr     1.1.4     ✔ readr     2.1.5
#> ✔ forcats   1.0.0     ✔ stringr   1.5.1
#> ✔ ggplot2   3.5.0     ✔ tibble    3.2.1
#> ✔ lubridate 1.9.3     ✔ tidyr     1.3.1
#> ✔ purrr     1.0.2     
#> ── Conflicts ────────────────────────────────────────── tidyverse_conflicts() ──
#> ✖ dplyr::filter() masks stats::filter()
#> ✖ dplyr::lag()    masks stats::lag()
#> ℹ Use the conflicted package (<http://conflicted.r-lib.org/>) to force all conflicts to become errors
#> 
#> Attaching package: 'plotly'
#> 
#> 
#> The following object is masked from 'package:ggplot2':
#> 
#>     last_plot
#> 
#> 
#> The following object is masked from 'package:stats':
#> 
#>     filter
#> 
#> 
#> The following object is masked from 'package:graphics':
#> 
#>     layout

1. Introduction

IS NUCLEAR GENERATOR A GOOD SOURCE OF ELECTRICITY POWER? IF NOT, WHAT ARE SOME GOOD ALTERNATIVES?

In this data set, we will be exploring a good source of power which we can use for a long time. The power source we will be exploring and analyzing will be Nuclear Power. Nuclear power is a form of energy production that harnesses the energy released from nuclear reactions. Nuclear power plants typically use fission to generate electricity. They are known for providing large amounts of continuous power and have low greenhouse gas emissions during operation.

For a step-by-step process on what will be done in this analysis, we will first explore which country will produce the most nuclear energy and see how much electricity is produced from them throughout the years on average. Secondly, we will see from those countries, what other source of power are they using which can be a good/viable alternative power source if Nuclear Energy isn’t working. Finally, we will see the trend of Nuclear energy across 10 years which is a good time frame to analyze the rise and fall of it and see if it will hold up for use in the future.

2.1 Data Summary

World Nuclear Generation

Firstly, wee will see the top countries which uses Nuclear Energy and see their electricity generation and electricity share.

#> Rows: 9603 Columns: 4
#> ── Column specification ────────────────────────────────────────────────────────
#> Delimiter: ","
#> chr (1): Entity
#> dbl (3): Year, electricity_from_nuclear_twh, share_of_electricity_pct
#> 
#> ℹ Use `spec()` to retrieve the full column specification for this data.
#> ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.
#>        Entity Year electricity_from_nuclear_twh share_of_electricity_pct
#> 1 Afghanistan 2000                            0                        0
#> 2 Afghanistan 2001                            0                        0
#> 3 Afghanistan 2002                            0                        0
#> 4 Afghanistan 2003                            0                        0
#> 5 Afghanistan 2004                            0                        0
#> 6 Afghanistan 2005                            0                        0
#>     Entity               Year      electricity_from_nuclear_twh
#>  Length:9603        Min.   :1965   Min.   :   0.00             
#>  Class :character   1st Qu.:1987   1st Qu.:   0.00             
#>  Mode  :character   Median :2003   Median :   0.00             
#>                     Mean   :2000   Mean   :  88.80             
#>                     3rd Qu.:2013   3rd Qu.:   3.97             
#>                     Max.   :2023   Max.   :2762.24             
#>                                                                
#>  share_of_electricity_pct
#>  Min.   : 0.000          
#>  1st Qu.: 0.000          
#>  Median : 0.000          
#>  Mean   : 5.803          
#>  3rd Qu.: 2.461          
#>  Max.   :88.011          
#>  NA's   :2457
#> 'data.frame':    9603 obs. of  4 variables:
#>  $ Entity                      : chr  "Afghanistan" "Afghanistan" "Afghanistan" "Afghanistan" ...
#>  $ Year                        : num  2000 2001 2002 2003 2004 ...
#>  $ electricity_from_nuclear_twh: num  0 0 0 0 0 0 0 0 0 0 ...
#>  $ share_of_electricity_pct    : num  0 0 0 0 0 0 0 0 0 0 ...

3.1 Data Preprocessing Data Set

Missing Data Checking and Filling

#> [1] 2457
#>                       Entity                         Year 
#>                            0                            0 
#> electricity_from_nuclear_twh     share_of_electricity_pct 
#>                            0                         2457

From the data shown above we can conclude that all of the mssing data from the ‘World Nuclear Generation’ dataset are in the ‘share_of_electricity_pct’ column. From this data, we decided to fill all this n/a value with 0 since we can’t delete it or it will significantly alter the dataset and we can’t fill it with mean value or it will also alter our dataset.

#> [1] 0
#>                       Entity                         Year 
#>                            0                            0 
#> electricity_from_nuclear_twh     share_of_electricity_pct 
#>                            0                            0

Duplicate Variables Checking

#> [1] 0

Here, we can see that there are no duplicate variables which means that we don’t need to clean our data from any duplicate variables.

4.1 Data Exploration

#> # A tibble: 6 × 3
#>   Entity         avgElectricityFromNuclear avgElectricityShare
#>   <chr>                              <dbl>               <dbl>
#> 1 Afghanistan                         0                   0   
#> 2 Africa                              7.55                1.50
#> 3 Africa (EI)                         7.55                1.50
#> 4 Africa (Ember)                     12.3                 1.85
#> 5 Albania                             0                   0   
#> 6 Algeria                             0                   0

Average Electricity Generation and Share

Electricity Generation

#> # A tibble: 6 × 2
#>   Entity                avgElectricityFromNuclear
#>   <chr>                                     <dbl>
#> 1 G20 (Ember)                               2477.
#> 2 OECD (Ember)                              2074.
#> 3 World                                     1745.
#> 4 G7 (Ember)                                1632.
#> 5 High-income countries                     1449.
#> 6 OECD (EI)                                 1441.

We want to calculate the top 5 countries with the most average electricty from nuclear generation and since the top 5 are not countries, we can drop them and redo the dataframe.

#> # A tibble: 6 × 2
#>   Entity                avgElectricityFromNuclear
#>   <chr>                                     <dbl>
#> 1 Europe (Ember)                            1167.
#> 2 North America (Ember)                      881.
#> 3 Europe                                     800.
#> 4 Europe (EI)                                687.
#> 5 North America (EI)                         611.
#> 6 North America                              594.

Here we can see that United States, France, Russia, Japan and Germany are the countries with the most electricity from nuclear energy on average. There are more generation from each country but we want to see the top 5 from the whole country. Thus we pick those 5 as the top.

5.1.1 Statistical Analysis
#> No trace type specified:
#>   Based on info supplied, a 'bar' trace seems appropriate.
#>   Read more about this trace type -> https://plotly.com/r/reference/#bar

Here we can see that the United States produced or used the most Nuclear Energy with a whopping 529 MW of energy on average through the years. It is then followed by France in 2nd, Russia in 3rd, Japan in 4th and Germany in 5th. We can see these are top countries which produced the most electricity power through Nuclear energy.

Electricity Share

#> # A tibble: 6 × 3
#>   Entity         avgElectricityFromNuclear avgElectricityShare
#>   <chr>                              <dbl>               <dbl>
#> 1 Armenia                             2.22                33.7
#> 2 Sweden                             48.0                 28.4
#> 3 Hungary                             9.45                27.8
#> 4 Switzerland                        18.8                 25.7
#> 5 Bulgaria                           11.8                 24.5
#> 6 Europe (Ember)                   1167.                  23.6

Here we can see that the top 5 countries with electical shares are Armenia, Sweden, Hungary, Switzerland, and Bulgaria.

5.1.2 Statistical Analysis
#> No trace type specified:
#>   Based on info supplied, a 'bar' trace seems appropriate.
#>   Read more about this trace type -> https://plotly.com/r/reference/#bar

Here we can see that Armenia is the leading country which produced the most electricity share through the years and its followed by Sweden, then Hungary, then Switzerland, and finally Bulgaria.

2.2 Data Summary

Global Nuclear Power Plant

#> Rows: 34936 Columns: 36
#> ── Column specification ────────────────────────────────────────────────────────
#> Delimiter: ","
#> chr (18): country, country_long, name, gppd_idnr, primary_fuel, other_fuel1,...
#> dbl (17): capacity_mw, latitude, longitude, commissioning_year, year_of_capa...
#> lgl  (1): other_fuel3
#> 
#> ℹ Use `spec()` to retrieve the full column specification for this data.
#> ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.
#>   country country_long
#> 1     AFG  Afghanistan
#> 2     AFG  Afghanistan
#> 3     AFG  Afghanistan
#> 4     AFG  Afghanistan
#> 5     AFG  Afghanistan
#> 6     AFG  Afghanistan
#>                                                        name    gppd_idnr
#> 1              Kajaki Hydroelectric Power Plant Afghanistan GEODB0040538
#> 2                                              Kandahar DOG   WKS0070144
#> 3                                              Kandahar JOL   WKS0071196
#> 4             Mahipar Hydroelectric Power Plant Afghanistan GEODB0040541
#> 5          Naghlu Dam Hydroelectric Power Plant Afghanistan GEODB0040534
#> 6 Nangarhar (Darunta) Hydroelectric Power Plant Afghanistan GEODB0040536
#>   capacity_mw latitude longitude primary_fuel other_fuel1 other_fuel2
#> 1       33.00  32.3220   65.1190        Hydro        <NA>        <NA>
#> 2       10.00  31.6700   65.7950        Solar        <NA>        <NA>
#> 3       10.00  31.6230   65.7920        Solar        <NA>        <NA>
#> 4       66.00  34.5560   69.4787        Hydro        <NA>        <NA>
#> 5      100.00  34.6410   69.7170        Hydro        <NA>        <NA>
#> 6       11.55  34.4847   70.3633        Hydro        <NA>        <NA>
#>   other_fuel3 commissioning_year owner     source
#> 1          NA                 NA  <NA>      GEODB
#> 2          NA                 NA  <NA> Wiki-Solar
#> 3          NA                 NA  <NA> Wiki-Solar
#> 4          NA                 NA  <NA>      GEODB
#> 5          NA                 NA  <NA>      GEODB
#> 6          NA                 NA  <NA>      GEODB
#>                                  url geolocation_source wepp_id
#> 1 http://globalenergyobservatory.org              GEODB 1009793
#> 2         https://www.wiki-solar.org         Wiki-Solar    <NA>
#> 3         https://www.wiki-solar.org         Wiki-Solar    <NA>
#> 4 http://globalenergyobservatory.org              GEODB 1009795
#> 5 http://globalenergyobservatory.org              GEODB 1009797
#> 6 http://globalenergyobservatory.org              GEODB 1009787
#>   year_of_capacity_data generation_gwh_2013 generation_gwh_2014
#> 1                  2017                  NA                  NA
#> 2                    NA                  NA                  NA
#> 3                    NA                  NA                  NA
#> 4                  2017                  NA                  NA
#> 5                  2017                  NA                  NA
#> 6                  2017                  NA                  NA
#>   generation_gwh_2015 generation_gwh_2016 generation_gwh_2017
#> 1                  NA                  NA                  NA
#> 2                  NA                  NA                  NA
#> 3                  NA                  NA                  NA
#> 4                  NA                  NA                  NA
#> 5                  NA                  NA                  NA
#> 6                  NA                  NA                  NA
#>   generation_gwh_2018 generation_gwh_2019 generation_data_source
#> 1                  NA                  NA                   <NA>
#> 2                  NA                  NA                   <NA>
#> 3                  NA                  NA                   <NA>
#> 4                  NA                  NA                   <NA>
#> 5                  NA                  NA                   <NA>
#> 6                  NA                  NA                   <NA>
#>   estimated_generation_gwh_2013 estimated_generation_gwh_2014
#> 1                        123.77                        162.90
#> 2                         18.43                         17.48
#> 3                         18.64                         17.58
#> 4                        225.06                        203.55
#> 5                        406.16                        357.22
#> 6                         58.77                         54.42
#>   estimated_generation_gwh_2015 estimated_generation_gwh_2016
#> 1                         97.39                        137.76
#> 2                         18.25                         17.70
#> 3                         19.10                         17.62
#> 4                        146.90                        230.18
#> 5                        270.99                        395.38
#> 6                         42.71                         59.72
#>   estimated_generation_gwh_2017 estimated_generation_note_2013
#> 1                        119.50                       HYDRO-V1
#> 2                         18.29                SOLAR-V1-NO-AGE
#> 3                         18.72                SOLAR-V1-NO-AGE
#> 4                        174.91                       HYDRO-V1
#> 5                        350.80                       HYDRO-V1
#> 6                         46.12                       HYDRO-V1
#>   estimated_generation_note_2014 estimated_generation_note_2015
#> 1                       HYDRO-V1                       HYDRO-V1
#> 2                SOLAR-V1-NO-AGE                SOLAR-V1-NO-AGE
#> 3                SOLAR-V1-NO-AGE                SOLAR-V1-NO-AGE
#> 4                       HYDRO-V1                       HYDRO-V1
#> 5                       HYDRO-V1                       HYDRO-V1
#> 6                       HYDRO-V1                       HYDRO-V1
#>   estimated_generation_note_2016 estimated_generation_note_2017
#> 1                       HYDRO-V1                       HYDRO-V1
#> 2                SOLAR-V1-NO-AGE                SOLAR-V1-NO-AGE
#> 3                SOLAR-V1-NO-AGE                SOLAR-V1-NO-AGE
#> 4                       HYDRO-V1                       HYDRO-V1
#> 5                       HYDRO-V1                       HYDRO-V1
#> 6                       HYDRO-V1                       HYDRO-V1

After knowing the top 5 Countries for electricity share and electricity produced from nuclear energy, we can see what other power plant those countries used and see their electricity generation.

#>    country          country_long           name            gppd_idnr        
#>  Length:34936       Length:34936       Length:34936       Length:34936      
#>  Class :character   Class :character   Class :character   Class :character  
#>  Mode  :character   Mode  :character   Mode  :character   Mode  :character  
#>                                                                             
#>                                                                             
#>                                                                             
#>                                                                             
#>   capacity_mw          latitude        longitude        primary_fuel      
#>  Min.   :    1.00   Min.   :-77.85   Min.   :-179.978   Length:34936      
#>  1st Qu.:    4.90   1st Qu.: 29.26   1st Qu.: -77.642   Class :character  
#>  Median :   16.75   Median : 39.73   Median :  -2.127   Mode  :character  
#>  Mean   :  163.35   Mean   : 32.82   Mean   :  -6.973                     
#>  3rd Qu.:   75.34   3rd Qu.: 46.26   3rd Qu.:  49.503                     
#>  Max.   :22500.00   Max.   : 71.29   Max.   : 179.389                     
#>                                                                           
#>  other_fuel1        other_fuel2        other_fuel3    commissioning_year
#>  Length:34936       Length:34936       Mode:logical   Min.   :1896      
#>  Class :character   Class :character   NA's:34936     1st Qu.:1988      
#>  Mode  :character   Mode  :character                  Median :2007      
#>                                                       Mean   :1997      
#>                                                       3rd Qu.:2014      
#>                                                       Max.   :2020      
#>                                                       NA's   :17489     
#>     owner              source              url            geolocation_source
#>  Length:34936       Length:34936       Length:34936       Length:34936      
#>  Class :character   Class :character   Class :character   Class :character  
#>  Mode  :character   Mode  :character   Mode  :character   Mode  :character  
#>                                                                             
#>                                                                             
#>                                                                             
#>                                                                             
#>    wepp_id          year_of_capacity_data generation_gwh_2013
#>  Length:34936       Min.   :2000          Min.   : -947.60   
#>  Class :character   1st Qu.:2017          1st Qu.:    1.95   
#>  Mode  :character   Median :2019          Median :   23.43   
#>                     Mean   :2018          Mean   :  592.70   
#>                     3rd Qu.:2019          3rd Qu.:  199.71   
#>                     Max.   :2019          Max.   :50834.00   
#>                     NA's   :20049         NA's   :28519      
#>  generation_gwh_2014 generation_gwh_2015 generation_gwh_2016
#>  Min.   : -989.62    Min.   : -864.43    Min.   : -768.62   
#>  1st Qu.:    2.26    1st Qu.:    2.66    1st Qu.:    2.73   
#>  Median :   23.61    Median :   26.14    Median :   22.46   
#>  Mean   :  656.86    Mean   :  762.37    Mean   :  693.15   
#>  3rd Qu.:  226.32    3rd Qu.:  285.86    3rd Qu.:  249.87   
#>  Max.   :32320.92    Max.   :37433.61    Max.   :32377.48   
#>  NA's   :27710       NA's   :26733       NA's   :25792      
#>  generation_gwh_2017 generation_gwh_2018 generation_gwh_2019
#>  Min.   : -934.94    Min.   : -982.62    Min.   : -780.34   
#>  1st Qu.:    2.47    1st Qu.:    2.24    1st Qu.:    2.75   
#>  Median :   17.88    Median :   12.53    Median :   11.53   
#>  Mean   :  661.83    Mean   :  517.32    Mean   :  423.92   
#>  3rd Qu.:  214.51    3rd Qu.:  151.12    3rd Qu.:  122.78   
#>  Max.   :36448.64    Max.   :35136.00    Max.   :31920.37   
#>  NA's   :25436       NA's   :25299       NA's   :25277      
#>  generation_data_source estimated_generation_gwh_2013
#>  Length:34936           Min.   :    1.12             
#>  Class :character       1st Qu.:    8.62             
#>  Mode  :character       Median :   27.62             
#>                         Mean   :  239.11             
#>                         3rd Qu.:  106.81             
#>                         Max.   :48675.06             
#>                         NA's   :18816                
#>  estimated_generation_gwh_2014 estimated_generation_gwh_2015
#>  Min.   :    0.87              Min.   :    0.44             
#>  1st Qu.:    8.68              1st Qu.:    8.38             
#>  Median :   28.25              Median :   26.83             
#>  Mean   :  242.43              Mean   :  235.87             
#>  3rd Qu.:  106.98              3rd Qu.:  103.12             
#>  Max.   :58470.77              Max.   :57113.35             
#>  NA's   :18433                 NA's   :17886                
#>  estimated_generation_gwh_2016 estimated_generation_gwh_2017
#>  Min.   :    0.30              Min.   :    0.00             
#>  1st Qu.:    8.32              1st Qu.:    8.18             
#>  Median :   27.56              Median :   37.59             
#>  Mean   :  235.70              Mean   :  716.44             
#>  3rd Qu.:  107.24              3rd Qu.:  229.56             
#>  Max.   :60859.73              Max.   :82810.77             
#>  NA's   :17366                 NA's   :1798                 
#>  estimated_generation_note_2013 estimated_generation_note_2014
#>  Length:34936                   Length:34936                  
#>  Class :character               Class :character              
#>  Mode  :character               Mode  :character              
#>                                                               
#>                                                               
#>                                                               
#>                                                               
#>  estimated_generation_note_2015 estimated_generation_note_2016
#>  Length:34936                   Length:34936                  
#>  Class :character               Class :character              
#>  Mode  :character               Mode  :character              
#>                                                               
#>                                                               
#>                                                               
#>                                                               
#>  estimated_generation_note_2017
#>  Length:34936                  
#>  Class :character              
#>  Mode  :character              
#>                                
#>                                
#>                                
#> 
#> 'data.frame':    34936 obs. of  36 variables:
#>  $ country                       : chr  "AFG" "AFG" "AFG" "AFG" ...
#>  $ country_long                  : chr  "Afghanistan" "Afghanistan" "Afghanistan" "Afghanistan" ...
#>  $ name                          : chr  "Kajaki Hydroelectric Power Plant Afghanistan" "Kandahar DOG" "Kandahar JOL" "Mahipar Hydroelectric Power Plant Afghanistan" ...
#>  $ gppd_idnr                     : chr  "GEODB0040538" "WKS0070144" "WKS0071196" "GEODB0040541" ...
#>  $ capacity_mw                   : num  33 10 10 66 100 ...
#>  $ latitude                      : num  32.3 31.7 31.6 34.6 34.6 ...
#>  $ longitude                     : num  65.1 65.8 65.8 69.5 69.7 ...
#>  $ primary_fuel                  : chr  "Hydro" "Solar" "Solar" "Hydro" ...
#>  $ other_fuel1                   : chr  NA NA NA NA ...
#>  $ other_fuel2                   : chr  NA NA NA NA ...
#>  $ other_fuel3                   : logi  NA NA NA NA NA NA ...
#>  $ commissioning_year            : num  NA NA NA NA NA ...
#>  $ owner                         : chr  NA NA NA NA ...
#>  $ source                        : chr  "GEODB" "Wiki-Solar" "Wiki-Solar" "GEODB" ...
#>  $ url                           : chr  "http://globalenergyobservatory.org" "https://www.wiki-solar.org" "https://www.wiki-solar.org" "http://globalenergyobservatory.org" ...
#>  $ geolocation_source            : chr  "GEODB" "Wiki-Solar" "Wiki-Solar" "GEODB" ...
#>  $ wepp_id                       : chr  "1009793" NA NA "1009795" ...
#>  $ year_of_capacity_data         : num  2017 NA NA 2017 2017 ...
#>  $ generation_gwh_2013           : num  NA NA NA NA NA NA NA NA NA NA ...
#>  $ generation_gwh_2014           : num  NA NA NA NA NA NA NA NA NA NA ...
#>  $ generation_gwh_2015           : num  NA NA NA NA NA NA NA NA NA NA ...
#>  $ generation_gwh_2016           : num  NA NA NA NA NA NA NA NA NA NA ...
#>  $ generation_gwh_2017           : num  NA NA NA NA NA NA NA NA NA NA ...
#>  $ generation_gwh_2018           : num  NA NA NA NA NA NA NA NA NA NA ...
#>  $ generation_gwh_2019           : num  NA NA NA NA NA NA NA NA NA NA ...
#>  $ generation_data_source        : chr  NA NA NA NA ...
#>  $ estimated_generation_gwh_2013 : num  123.8 18.4 18.6 225.1 406.2 ...
#>  $ estimated_generation_gwh_2014 : num  162.9 17.5 17.6 203.6 357.2 ...
#>  $ estimated_generation_gwh_2015 : num  97.4 18.2 19.1 146.9 271 ...
#>  $ estimated_generation_gwh_2016 : num  137.8 17.7 17.6 230.2 395.4 ...
#>  $ estimated_generation_gwh_2017 : num  119.5 18.3 18.7 174.9 350.8 ...
#>  $ estimated_generation_note_2013: chr  "HYDRO-V1" "SOLAR-V1-NO-AGE" "SOLAR-V1-NO-AGE" "HYDRO-V1" ...
#>  $ estimated_generation_note_2014: chr  "HYDRO-V1" "SOLAR-V1-NO-AGE" "SOLAR-V1-NO-AGE" "HYDRO-V1" ...
#>  $ estimated_generation_note_2015: chr  "HYDRO-V1" "SOLAR-V1-NO-AGE" "SOLAR-V1-NO-AGE" "HYDRO-V1" ...
#>  $ estimated_generation_note_2016: chr  "HYDRO-V1" "SOLAR-V1-NO-AGE" "SOLAR-V1-NO-AGE" "HYDRO-V1" ...
#>  $ estimated_generation_note_2017: chr  "HYDRO-V1" "SOLAR-V1-NO-AGE" "SOLAR-V1-NO-AGE" "HYDRO-V1" ...

3.2 Data Preprocessing

Missing Data Variable and Filling

#> [1] 455949
#>                        country                   country_long 
#>                              0                              0 
#>                           name                      gppd_idnr 
#>                              0                              0 
#>                    capacity_mw                       latitude 
#>                              0                              0 
#>                      longitude                   primary_fuel 
#>                              0                              0 
#>                    other_fuel1                    other_fuel2 
#>                          32992                          34660 
#>                    other_fuel3             commissioning_year 
#>                          34936                          17489 
#>                          owner                         source 
#>                          14068                             15 
#>                            url             geolocation_source 
#>                             18                            419 
#>                        wepp_id          year_of_capacity_data 
#>                          18702                          20049 
#>            generation_gwh_2013            generation_gwh_2014 
#>                          28519                          27710 
#>            generation_gwh_2015            generation_gwh_2016 
#>                          26733                          25792 
#>            generation_gwh_2017            generation_gwh_2018 
#>                          25436                          25299 
#>            generation_gwh_2019         generation_data_source 
#>                          25277                          23536 
#>  estimated_generation_gwh_2013  estimated_generation_gwh_2014 
#>                          18816                          18433 
#>  estimated_generation_gwh_2015  estimated_generation_gwh_2016 
#>                          17886                          17366 
#>  estimated_generation_gwh_2017 estimated_generation_note_2013 
#>                           1798                              0 
#> estimated_generation_note_2014 estimated_generation_note_2015 
#>                              0                              0 
#> estimated_generation_note_2016 estimated_generation_note_2017 
#>                              0                              0

From the data above, there seems to be a lot of missing data. But after looking at the missing data based on columns, we won’t need to fix any of the mssing data since we won’t need any of the missing data columns. The columns we will need are ‘country’, ‘country_long’, ‘capacity_mw’ and ‘primary_fuel’.

4.2 Data Exploration

Top Country Electricity Generation and Share

Top Electricity Generation

#>       country country_long                 name  gppd_idnr capacity_mw latitude
#> 10291     FRA       France              ARRIGHI WRI1002687      254.00  48.7872
#> 10292     FRA       France                ASTON WRI1002688      104.00  42.7770
#> 10293     FRA       France              AVIGNON WRI1002689      126.00  43.9760
#> 10294     FRA       France Ablaincourt-Pressoir WRI1024142       14.35  49.8414
#> 10295     FRA       France        Ablainzevelle WRI1024534       10.00  50.1529
#> 10296     FRA       France      Achiet-le-Grand WRI1024086       18.00  50.1327
#>       longitude primary_fuel other_fuel1 other_fuel2 other_fuel3
#> 10291    2.4033          Oil        <NA>        <NA>          NA
#> 10292    1.6770        Hydro        <NA>        <NA>          NA
#> 10293    4.8170        Hydro        <NA>        <NA>          NA
#> 10294    2.8247         Wind        <NA>        <NA>          NA
#> 10295    2.7410         Wind        <NA>        <NA>          NA
#> 10296    2.7791         Wind        <NA>        <NA>          NA
#>       commissioning_year owner                    source
#> 10291                 NA  <NA> RTE Transmission Operator
#> 10292                 NA  <NA> RTE Transmission Operator
#> 10293                 NA  <NA> RTE Transmission Operator
#> 10294                 NA  <NA>    Open Power System Data
#> 10295                 NA  <NA>    Open Power System Data
#> 10296                 NA  <NA>    Open Power System Data
#>                                                                               url
#> 10291 http://clients.rte-france.com/lang/an/visiteurs/vie/prod/parc_reference.jsp
#> 10292 http://clients.rte-france.com/lang/an/visiteurs/vie/prod/parc_reference.jsp
#> 10293 http://clients.rte-france.com/lang/an/visiteurs/vie/prod/parc_reference.jsp
#> 10294   http://data.open-power-system-data.org/renewable_power_plants/2016-10-21/
#> 10295   http://data.open-power-system-data.org/renewable_power_plants/2016-10-21/
#> 10296   http://data.open-power-system-data.org/renewable_power_plants/2016-10-21/
#>           geolocation_source wepp_id year_of_capacity_data generation_gwh_2013
#> 10291                  CARMA 1014623                    NA                  NA
#> 10292                  GEODB 1014631                    NA                  NA
#> 10293                  GEODB 1013202                    NA                  NA
#> 10294 Open Power System Data    <NA>                    NA                  NA
#> 10295 Open Power System Data 1086105                    NA                  NA
#> 10296 Open Power System Data    <NA>                    NA                  NA
#>       generation_gwh_2014 generation_gwh_2015 generation_gwh_2016
#> 10291                  NA               9.619               6.998
#> 10292                  NA                  NA                  NA
#> 10293                  NA                  NA                  NA
#> 10294                  NA                  NA                  NA
#> 10295                  NA                  NA                  NA
#> 10296                  NA                  NA                  NA
#>       generation_gwh_2017 generation_gwh_2018 generation_gwh_2019
#> 10291              35.608                  NA                  NA
#> 10292                  NA                  NA                  NA
#> 10293                  NA                  NA                  NA
#> 10294                  NA                  NA                  NA
#> 10295                  NA                  NA                  NA
#> 10296                  NA                  NA                  NA
#>       generation_data_source estimated_generation_gwh_2013
#> 10291          JRC-PPDB-OPEN                            NA
#> 10292                   <NA>                        295.74
#> 10293                   <NA>                        546.92
#> 10294                   <NA>                            NA
#> 10295                   <NA>                            NA
#> 10296                   <NA>                            NA
#>       estimated_generation_gwh_2014 estimated_generation_gwh_2015
#> 10291                            NA                            NA
#> 10292                        301.46                        352.24
#> 10293                        587.61                        374.82
#> 10294                            NA                            NA
#> 10295                            NA                            NA
#> 10296                            NA                            NA
#>       estimated_generation_gwh_2016 estimated_generation_gwh_2017
#> 10291                            NA                        271.68
#> 10292                        302.29                        352.24
#> 10293                        379.85                        459.52
#> 10294                            NA                         26.24
#> 10295                            NA                         18.28
#> 10296                            NA                         32.91
#>       estimated_generation_note_2013 estimated_generation_note_2014
#> 10291                  NO-ESTIMATION                  NO-ESTIMATION
#> 10292                       HYDRO-V1                       HYDRO-V1
#> 10293                       HYDRO-V1                       HYDRO-V1
#> 10294                  NO-ESTIMATION                  NO-ESTIMATION
#> 10295                  NO-ESTIMATION                  NO-ESTIMATION
#> 10296                  NO-ESTIMATION                  NO-ESTIMATION
#>       estimated_generation_note_2015 estimated_generation_note_2016
#> 10291                  NO-ESTIMATION                  NO-ESTIMATION
#> 10292                       HYDRO-V1                       HYDRO-V1
#> 10293                       HYDRO-V1                       HYDRO-V1
#> 10294                  NO-ESTIMATION                  NO-ESTIMATION
#> 10295                  NO-ESTIMATION                  NO-ESTIMATION
#> 10296                  NO-ESTIMATION                  NO-ESTIMATION
#>       estimated_generation_note_2017
#> 10291             CAPACITY-FACTOR-V1
#> 10292                       HYDRO-V1
#> 10293                       HYDRO-V1
#> 10294             CAPACITY-FACTOR-V1
#> 10295             CAPACITY-FACTOR-V1
#> 10296             CAPACITY-FACTOR-V1
#> [1] "FRA" "DEU" "JPN" "RUS" "USA"
5.2.1 Statistical Analysis
#> # A tibble: 5 × 5
#>   country_long             longitude latitude electricGeneration countryCode
#>   <chr>                        <dbl>    <dbl>              <dbl> <chr>      
#> 1 France                        2.05     44.7               51.3 FRA        
#> 2 Germany                      10.7      50.9               85.6 DEU        
#> 3 Japan                       137.       35.9              413.  JPN        
#> 4 Russia                       61.4      54.7              419.  RUS        
#> 5 United States of America    -94.1      39.0              123.  USA

Here we can see the world map in an interactive plot where we can see the electricity generation with nuclear energy on the top 5 countries which produce the most nuclear energy. Here we can see that Japan and Russia is really similar in electric generation and US in 2nd, France in 3rd, and Germany in 4th.

Top Electricity Share

#>     country country_long                  name  gppd_idnr capacity_mw latitude
#> 329     ARM      Armenia            Armenian-2 WRI1019028         375  40.1805
#> 330     ARM      Armenia              Hrazadan WRI1019027        1110  40.5640
#> 331     ARM      Armenia           New Yerevan WRI1019026         271  40.1152
#> 332     ARM      Armenia Sevan-Hrazdan Cascade WRI1003805         561  40.5078
#> 333     ARM      Armenia                 Shamb WRI1019030         170  39.4743
#> 334     ARM      Armenia            Spandaryan WRI1019031          76  39.6494
#>     longitude primary_fuel other_fuel1 other_fuel2 other_fuel3
#> 329   44.1498      Nuclear        <NA>        <NA>          NA
#> 330   44.7479          Gas        <NA>        <NA>          NA
#> 331   44.4973          Gas        <NA>        <NA>          NA
#> 332   44.7606        Hydro        <NA>        <NA>          NA
#> 333   46.1306        Hydro        <NA>        <NA>          NA
#> 334   45.8500        Hydro        <NA>        <NA>          NA
#>     commissioning_year                                   owner
#> 329               1980          Armenian Nuclear Power Company
#> 330                 NA                  Hrazdan Energy Company
#> 331               2010                            Yerevan  TPP
#> 332                 NA                                RusHydro
#> 333               1978 Ministry of Energy an Natural Resources
#> 334               1989 Ministry of Energy an Natural Resources
#>                                               source
#> 329                                             IAEA
#> 330 Armenia Ministry of Energy and Natural Resources
#> 331 Armenia Ministry of Energy and Natural Resources
#> 332 Armenia Ministry of Energy and Natural Resources
#> 333 Armenia Ministry of Energy and Natural Resources
#> 334 Armenia Ministry of Energy and Natural Resources
#>                                                                           url
#> 329 https://www.iaea.org/PRIS/CountryStatistics/ReactorDetails.aspx?current=2
#> 330                                       http://www.minenergy.am/en/page/532
#> 331                                       http://www.minenergy.am/en/page/531
#> 332                                       http://www.minenergy.am/en/page/534
#> 333                                       http://www.minenergy.am/en/page/533
#> 334                                       http://www.minenergy.am/en/page/533
#>     geolocation_source wepp_id year_of_capacity_data generation_gwh_2013
#> 329              GEODB 1010334                    NA                  NA
#> 330              GEODB 1010329                    NA                  NA
#> 331                WRI    <NA>                    NA                  NA
#> 332              GEODB    <NA>                    NA                  NA
#> 333              GEODB    <NA>                    NA                  NA
#> 334              GEODB    <NA>                    NA                  NA
#>     generation_gwh_2014 generation_gwh_2015 generation_gwh_2016
#> 329                  NA                  NA                  NA
#> 330                  NA                  NA                  NA
#> 331                  NA                  NA                  NA
#> 332                  NA                  NA                  NA
#> 333                  NA                  NA                  NA
#> 334                  NA                  NA                  NA
#>     generation_gwh_2017 generation_gwh_2018 generation_gwh_2019
#> 329                  NA                  NA                  NA
#> 330                  NA                  NA                  NA
#> 331                  NA                  NA                  NA
#> 332                  NA                  NA                  NA
#> 333                  NA                  NA                  NA
#> 334                  NA                  NA                  NA
#>     generation_data_source estimated_generation_gwh_2013
#> 329                   <NA>                            NA
#> 330                   <NA>                            NA
#> 331                   <NA>                            NA
#> 332                   <NA>                        847.52
#> 333                   <NA>                        254.59
#> 334                   <NA>                         91.89
#>     estimated_generation_gwh_2014 estimated_generation_gwh_2015
#> 329                            NA                            NA
#> 330                            NA                            NA
#> 331                            NA                            NA
#> 332                        854.95                        838.63
#> 333                        248.21                        225.18
#> 334                         59.65                        134.22
#>     estimated_generation_gwh_2016 estimated_generation_gwh_2017
#> 329                            NA                       2411.04
#> 330                            NA                       6712.82
#> 331                            NA                       1638.89
#> 332                        846.61                        853.67
#> 333                        234.89                        226.04
#> 334                        106.80                         91.89
#>     estimated_generation_note_2013 estimated_generation_note_2014
#> 329                  NO-ESTIMATION                  NO-ESTIMATION
#> 330                  NO-ESTIMATION                  NO-ESTIMATION
#> 331                  NO-ESTIMATION                  NO-ESTIMATION
#> 332                       HYDRO-V1                       HYDRO-V1
#> 333                       HYDRO-V1                       HYDRO-V1
#> 334                       HYDRO-V1                       HYDRO-V1
#>     estimated_generation_note_2015 estimated_generation_note_2016
#> 329                  NO-ESTIMATION                  NO-ESTIMATION
#> 330                  NO-ESTIMATION                  NO-ESTIMATION
#> 331                  NO-ESTIMATION                  NO-ESTIMATION
#> 332                       HYDRO-V1                       HYDRO-V1
#> 333                       HYDRO-V1                       HYDRO-V1
#> 334                       HYDRO-V1                       HYDRO-V1
#>     estimated_generation_note_2017
#> 329             CAPACITY-FACTOR-V1
#> 330             CAPACITY-FACTOR-V1
#> 331             CAPACITY-FACTOR-V1
#> 332                       HYDRO-V1
#> 333                       HYDRO-V1
#> 334                       HYDRO-V1
5.2.2 Statistical Analysis
#> # A tibble: 5 × 5
#>   country_long longitude latitude electricGeneration countryCode
#>   <chr>            <dbl>    <dbl>              <dbl> <chr>      
#> 1 Armenia          45.1      40.0              409.  ARM        
#> 2 Bulgaria         25.4      42.7              215.  BGR        
#> 3 Hungary          19.2      47.3              349.  HUN        
#> 4 Sweden           16.2      61.7              157.  SWE        
#> 5 Switzerland       8.28     46.7               78.1 CHE

Here we can see the world map in an interactive plot where we can see the electricity generation with nuclear energy on the top 5 countries which share the most nuclear energy. Here we can see that Armenia is 1st, then Hungary, then Bulgaria, then Sweden then Switzerland.

Main Source of Fuel for Top Countries with Most Electricity Generation

#> # A tibble: 6 × 4
#>   country_long primary_fuel count meanPower
#>   <chr>        <chr>        <int>     <dbl>
#> 1 France       Biomass        148      5.36
#> 2 France       Coal             5    715   
#> 3 France       Gas              9    556.  
#> 4 France       Geothermal       1      4.5 
#> 5 France       Hydro          429     45.5 
#> 6 France       Nuclear         19   3323.
#> [1] 5

Here we can see that from the top 5 countries that used the most nuclear energy power also has used nuclear energy as their main power source. Gas and Coal have a similar height in use while other pwer source are low in use. From this we can conclude that Nuclear energy is a good use of a power source and being Coal or Gas as an alternative.

2.3 Data Summary

Nuclear Energy Overview over the Years

#> Rows: 614 Columns: 7
#> ── Column specification ────────────────────────────────────────────────────────
#> Delimiter: ","
#> chr (2): Month, Nuclear Generating Units, Total Operable Units
#> dbl (5): Year, Nuclear Generating Units, Net Summer Capacity, Nuclear Electr...
#> 
#> ℹ Use `spec()` to retrieve the full column specification for this data.
#> ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.
#>   Year    Month Nuclear_Generating_Units,_Total_Operable_Units
#> 1 1973  January                                  Not Available
#> 2 1973 February                                  Not Available
#> 3 1973    March                                  Not Available
#> 4 1973    April                                  Not Available
#> 5 1973      May                                  Not Available
#> 6 1973     June                                  Not Available
#>   Nuclear_Generating_Units,_Net_Summer_Capacity
#> 1                                        14.533
#> 2                                        14.533
#> 3                                        15.314
#> 4                                        15.314
#> 5                                        16.174
#> 6                                        18.729
#>   Nuclear_Electricity_Net_Generation
#> 1                               6246
#> 2                               5928
#> 3                               6649
#> 4                               5876
#> 5                               5697
#> 6                               6784
#>   Nuclear_Share_of_Electricity_Net_Generation
#> 1                                         3.9
#> 2                                         4.1
#> 3                                         4.5
#> 4                                         4.2
#> 5                                         3.9
#> 6                                         4.2
#>   Nuclear_Generating_Units,_Capacity_Factor
#> 1                                      57.8
#> 2                                      60.7
#> 3                                      58.4
#> 4                                      53.4
#> 5                                      47.3
#> 6                                      50.3
#>       Year         Month          
#>  Min.   :1973   Length:614        
#>  1st Qu.:1985   Class :character  
#>  Median :1998   Mode  :character  
#>  Mean   :1998                     
#>  3rd Qu.:2011                     
#>  Max.   :2024                     
#>  Nuclear_Generating_Units,_Total_Operable_Units
#>  Length:614                                    
#>  Class :character                              
#>  Mode  :character                              
#>                                                
#>                                                
#>                                                
#>  Nuclear_Generating_Units,_Net_Summer_Capacity
#>  Min.   : 14.53                               
#>  1st Qu.: 78.71                               
#>  Median : 98.53                               
#>  Mean   : 85.61                               
#>  3rd Qu.: 99.63                               
#>  Max.   :102.21                               
#>  Nuclear_Electricity_Net_Generation Nuclear_Share_of_Electricity_Net_Generation
#>  Min.   : 5697                      Min.   : 3.90                              
#>  1st Qu.:31482                      1st Qu.:15.53                              
#>  Median :57362                      Median :18.80                              
#>  Mean   :49806                      Mean   :17.22                              
#>  3rd Qu.:65169                      3rd Qu.:20.10                              
#>  Max.   :74649                      Max.   :22.90                              
#>  Nuclear_Generating_Units,_Capacity_Factor
#>  Min.   : 34.60                           
#>  1st Qu.: 61.02                           
#>  Median : 79.15                           
#>  Mean   : 76.50                           
#>  3rd Qu.: 91.88                           
#>  Max.   :101.60
#> 'data.frame':    614 obs. of  7 variables:
#>  $ Year                                          : num  1973 1973 1973 1973 1973 ...
#>  $ Month                                         : chr  "January" "February" "March" "April" ...
#>  $ Nuclear_Generating_Units,_Total_Operable_Units: chr  "Not Available" "Not Available" "Not Available" "Not Available" ...
#>  $ Nuclear_Generating_Units,_Net_Summer_Capacity : num  14.5 14.5 15.3 15.3 16.2 ...
#>  $ Nuclear_Electricity_Net_Generation            : num  6246 5928 6649 5876 5697 ...
#>  $ Nuclear_Share_of_Electricity_Net_Generation   : num  3.9 4.1 4.5 4.2 3.9 4.2 4 4.4 5 4.9 ...
#>  $ Nuclear_Generating_Units,_Capacity_Factor     : num  57.8 60.7 58.4 53.4 47.3 50.3 50 54.5 56.9 49.8 ...

3.3 Data Preprocessing

Missing Data and Filling

#> [1] 0
#>                                           Year 
#>                                              0 
#>                                          Month 
#>                                              0 
#> Nuclear_Generating_Units,_Total_Operable_Units 
#>                                              0 
#>  Nuclear_Generating_Units,_Net_Summer_Capacity 
#>                                              0 
#>             Nuclear_Electricity_Net_Generation 
#>                                              0 
#>    Nuclear_Share_of_Electricity_Net_Generation 
#>                                              0 
#>      Nuclear_Generating_Units,_Capacity_Factor 
#>                                              0

Data Handling

#>   Year    Month Nuclear_Generating_Units,_Total_Operable_Units
#> 1 2014  January                                            100
#> 2 2014 February                                            100
#> 3 2014    March                                            100
#> 4 2014    April                                            100
#> 5 2014      May                                            100
#> 6 2014     June                                            100
#>   Nuclear_Generating_Units,_Net_Summer_Capacity
#> 1                                        99.182
#> 2                                        99.182
#> 3                                        99.182
#> 4                                        99.182
#> 5                                        99.182
#> 6                                        99.182
#>   Nuclear_Electricity_Net_Generation
#> 1                              73163
#> 2                              62639
#> 3                              62397
#> 4                              56385
#> 5                              62947
#> 6                              68138
#>   Nuclear_Share_of_Electricity_Net_Generation
#> 1                                        19.4
#> 2                                        19.3
#> 3                                        18.8
#> 4                                        18.9
#> 5                                        19.4
#> 6                                        19.0
#>   Nuclear_Generating_Units,_Capacity_Factor
#> 1                                      99.1
#> 2                                      94.0
#> 3                                      84.5
#> 4                                      78.8
#> 5                                      85.2
#> 6                                      95.4

4.3 Data Exploration

Nuclear Generating Units

#>    Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
#>   55290   62852   65869   66160   69730   74649
#>    Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
#>   92.00   93.00   98.00   96.52   99.00  100.00

Correlation between Total Electricity Generation and Total Generating Units

#> [1] 0.2141611

Electicity Generation Over the Years

From the graph above, we can see that the electricity produced by nuclear generators have an incline from 2014 - 2019 and after that had a major drop until 2022 and has only went up since 2023. We can assume that the drop was caused by the pandemic where most workers were asked to work hom or don’t work. This caused nuclear generators to stop working or less working. Post COVID gives a good result where more generators seems to be working again.

Generator Units over the Years

From the Graph shown, we can conclude that the operational nuclear generators also dropped from 2019 - 2022 since of COVID-19. There seems to be an incline in operational nuclear generators post COVID (2023-2024) where people can work more.

5.3 Statistical Analysis

T-Test to find out if Electricity Drop is caused by COVID

#> 
#>  Welch Two Sample t-test
#> 
#> data:  pre_covid$meanGeneration and post_covid$meanGeneration
#> t = 3.3727, df = 5.1815, p-value = 0.01877
#> alternative hypothesis: true difference in means is not equal to 0
#> 95 percent confidence interval:
#>   407.6291 2909.4265
#> sample estimates:
#> mean of x mean of y 
#>  66965.03  65306.50

Based on the statistical analysis, there is evidence to suggest that there is a difference in power generation between the pre-COVID and post-COVID periods. The p-value (0.01877) indicates statistical significance, and the confidence interval does not include 0. Post-COVID generation appears to be lower than pre-COVID generation.

Correlation between Average Units and Average Power Generation over the Years

#> [1] 0.7437131

1 = Strong Positive Correlation 0 = No Positive Correlation -1 = Strong Negative Correlation

According to the correlation calculation above we can conclude that number of operational units has a strong positive correlation with the average power electricity generation.

Discussion

Based on our analysis we can see that Nuclear Energy has been used by most countries especially the top 5 countries on our analysis. We now know that the trend of the use of Nuclear Energy has been used most of the time before the pandemic, has dropped drastically during it and is trying to rise up again post pandemic. We also have additional info based on other source of fuel being gas and hydro as an alternative energy source if nuclear energy isn’t working out. Coal and Oil are high up based on our findings but it isn’t a good choice since those fuel source are limited. We should also increase our other fuel sources such as wind and solar fuel as they aren’t limited fuel energy source.

Conclusion

After this whole Analysis, we can conclude that Nuclear Energy is a good power source to use and is a good long term power source to use for the future. There seems to be an incline in Nuclear power use and more countries are using more as a source of power. This analysis can be researched further in the case of in the coming future or maybe in more specific areas around the world or maybe added more analysis surrounding nuclear energy.

References

Data Set Link: link